NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Dense Paraphrasing for multimodal dialogue interpretation

https://doi.org/10.3389/frai.2024.1479905

Tu, Jingxuan; Rim, Kyeongmin; Ye, Bingyang; Lai, Kenneth; Pustejovsky, James (December 2024, Frontiers in Artificial Intelligence)

Multimodal dialogue involving multiple participants presents complex computational challenges, primarily due to the rich interplay of diverse communicative modalities including speech, gesture, action, and gaze. These modalities interact in complex ways that traditional dialogue systems often struggle to accurately track and interpret. To address these challenges, we extend the textual enrichment strategy of Dense Paraphrasing (DP), by translating each nonverbal modality into linguistic expressions. By normalizing multimodal information into a language-based form, we hope to both simplify the representation for and enhance the computational understanding of situated dialogues. We show the effectiveness of the dense paraphrased language form by evaluating instruction-tuned Large Language Models (LLMs) against the Common Ground Tracking (CGT) problem using a publicly available collaborative problem-solving dialogue dataset. Instead of using multimodal LLMs, the dense paraphrasing technique represents the dialogue information from multiple modalities in a compact and structured machine-readable text format that can be directly processed by the language-only models. We leverage the capability of LLMs to transform machine-readable paraphrases into human-readable paraphrases, and show that this process can further improve the result on the CGT task. Overall, the results show that augmenting the context with dense paraphrasing effectively facilitates the LLMs' alignment of information from multiple modalities, and in turn largely improves the performance of common ground reasoning over the baselines. Our proposed pipeline with original utterances as input context already achieves comparable results to the baseline that utilized decontextualized utterances which contain rich coreference information. When also using the decontextualized input, our pipeline largely improves the performance of common ground reasoning over the baselines. We discuss the potential of DP to create a robust model that can effectively interpret and integrate the subtleties of multimodal communication, thereby improving dialogue system performance in real-world settings.
more » « less
Full Text Available
Semantically Enriched Text Generation for {QA} through Dense Paraphrasing

Obiso, Timothy; Ye, Bingyang; Rim, Kyeongmin; Pustejovsky, James (October 2024, Association for Computational Linguistics)
Abbas, Mourad; Freihat, Abed Alhakim (Ed.)
Large Language Models (LLMs) are very effective at extractive language tasks such as Question Answering (QA). While LLMs can improve their performance on these tasks through increases in model size (via massive pretraining) and/or iterative on-the-job training (one-shot, few-shot, chain-of-thought), we explore what other less resource-intensive and more efficient types of data augmentation can be applied to obtain similar boosts in performance. We define multiple forms of Dense Paraphrasing (DP) and obtain DP-enriched versions of different contexts. We demonstrate that performing QA using these semantically enriched contexts leads to increased performance on models of various sizes and across task domains, without needing to increase model size.
more » « less
Full Text Available
Linguistically Conditioned Semantic Textual Similarity

Tu, Jingxuan; Xu, Keer; Yue, Liulu; Ye, Bingyang; Rim, Kyeongmin; Pustejovsky, James (August 2024, ACL)

Semantic textual similarity (STS) is a fundamental NLP task that measures the semantic similarity between a pair of sentences. In order to reduce the inherent ambiguity posed from the sentences, a recent work called Conditional STS (C-STS) has been proposed to measure the sentences’ similarity conditioned on a certain aspect. Despite the popularity of C-STS, we ﬁnd that the current C-STS dataset suffers from various issues that could impede proper evaluation on this task. In this paper, we reannotate the C-STS validation set and observe an annotator discrepancy on 55% of the instances resulting from the annotation errors in the original label, ill-deﬁned conditions, and the lack of clarity in the task deﬁnition. After a thorough dataset analysis, we improve the C-STS task by leveraging the models’ capability to understand the conditions under a QA task setting. With the generated answers, we present an automatic error identiﬁcation pipeline that is able to identify annotation errors from the C-STS data with over 80% F1 score. We also propose a new method that largely improves the performance over baselines on the C-STS data by training the models with the answers. Finally we discuss the conditionality annotation based on the typed-feature structure (TFS) of entity types. We show in examples that the TFS is able to provide a linguistic foundation for constructing C-STS data with new conditions.
more » « less
Full Text Available
GLAMR: Augmenting AMR with GL-VerbNet Event Structure

Tu, Jingxuan; Obiso, Timothy; Ye, Bingyang; Rim, Kyeongmin; Xu, Keer; Yue, Liulu; Brown, Susan Windisch; Palmer, Martha; Pustejovsky, James (May 2024, Association for Computational Linguistics)

This paper introduces GLAMR, an Abstract Meaning Representation (AMR) interpretation of Generative Lexicon (GL) semantic components. It includes a structured subeventual interpretation of linguistic predicates, and encoding of the opposition structure of property changes of event arguments. Both of these features are recently encoded in VerbNet (VN), and form the scaffolding for the semantic form associated with VN frame files. We develop a new syntax, concepts, and roles for subevent structure based on VN for connecting subevents to atomic predicates. Our proposed extension is compatible with current AMR specification. We also present an approach to automatically augment AMR graphs by inserting subevent structure of the predicates and identifying the subevent arguments from the semantic roles. A pilot annotation of GLAMR graphs of 65 documents (486 sentences), based on procedural texts as a source, is presented as a public dataset. The annotation includes subevents, argument property change, and document-level anaphoric links. Finally, we provide baseline models for converting text to GLAMR and vice versa, along with the application of GLAMR for generating enriched paraphrases with details on subevent transformation and arguments that are not present in the surface form of the texts.
more » « less
Full Text Available
GLAMR: Augmenting AMR with GL-VerbNet Event Structure

Tu, Jingxuan; Obiso, Timothy; Ye, Bingyang; Rim, Kyeongmin; Xu, Keer; Yue, Liulu; Brown, Susan Windisch; Palmer, Martha; Pustejovsky, James (May 2024, ACL)

This paper introduces GLAMR, an Abstract Meaning Representation (AMR) interpretation of Generative Lexicon (GL) semantic components. It includes a structured subeventual interpretation of linguistic predicates, and encoding of the opposition structure of property changes of event arguments. Both of these features are recently encoded in VerbNet (VN), and form the scaffolding for the semantic form associated with VN frame ﬁles. We develop a new syntax, concepts, and roles for subevent structure based on VN for connecting subevents to atomic predicates. Our proposed extension is compatible with current AMR speciﬁcation. We also present an approach to automatically augment AMR graphs by inserting subevent structure of the predicates and identifying the subevent arguments from the semantic roles. A pilot annotation of GLAMR graphs of 65 documents (486 sentences), based on procedural texts as a source, is presented as a public dataset. The annotation includes subevents, argument property change, and document-level anaphoric links. Finally, we provide baseline models for converting text to GLAMR and vice versa, along with the application of GLAMR for generating enriched paraphrases with details on subevent transformation and arguments that are not present in the surface form of the texts.
more » « less
Full Text Available
Linguistically Conditioned Semantic Textual Similarity

https://doi.org/10.18653/v1/2024.acl-long.64

Tu, Jingxuan; Xu, Keer; Yue, Liulu; Ye, Bingyang; Rim, Kyeongmin; Pustejovsky, James (January 2024, Association for Computational Linguistics)

Full Text Available
Integrated Annotation of Event Structure, Object States, and Entity Coreference

https://doi.org/10.18653/v1/2023.crac-main.9

Rim, Kyeongmin; Pustejovsky, James (January 2023, Association for Computational Linguistics)

Full Text Available
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop

Cordeiro, Silvio Ricardo; Oraby, Shereen; Pavalanathan, Umashanthi; Rim, Kyeongmin (January 2018, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop)

This volume collects the accepted peer-reviewed papers presented at the 2018 NAACL Student Research Workshop, an event for junior researchers held as part of the larger NAACL 2018 conference.
more » « less
Full Text Available

Search for: All records